Search CORE

46 research outputs found

Single-Channel Speech Enhancement with Deep Complex U-Networks and Probabilistic Latent Space Models

Author: Anemüller Jörn
Nustede Eike J.
Publication venue
Publication date: 04/09/2023
Field of study

In this paper, we propose to extend the deep, complex U-Network architecture for speech enhancement by incorporating a probabilistic (i.e., variational) latent space model. The proposed model is evaluated against several ablated versions of itself in order to study the effects of the variational latent space model, complex-value processing, and self-attention. Evaluation on the MS-DNS 2020 and Voicebank+Demand datasets yields consistently high performance. E.g., the proposed model achieves an SI-SDR of up to 20.2 dB, about 0.5 to 1.4 dB higher than its ablated version without probabilistic latent space, 2-2.4 dB higher than WaveUNet, and 6.7 dB above PHASEN. Compared to real-valued magnitude spectrogram processing with a variational U-Net, the complex U-Net achieves an improvement of up to 4.5 dB SI-SDR. Complex spectrum encoding as magnitude and phase yields best performance in anechoic conditions whereas real and imaginary part representation results in better generalization to (novel) reverberation conditions, possibly due to the underlying physics of sound

arXiv.org e-Print Archive

Complex Independent Component Analysis of Frequency-Domain Electroencephalographic Data

Author: Amari
Anemüller
Arieli
Bell
Berger
Bloomfield
Courchesne
Enghoff
Freeman
Hyvärinen
Jung
Jörn Anemüller
Kuhn
Lopez da Silva
Makeig
Makeig
Makeig
Makeig
Makeig
Nikias
Scott Makeig
Stein
Terrence J. Sejnowski
Torkkola
Zibulevsky
Publication venue: 'Elsevier BV'
Publication date: 01/01/2003
Field of study

Independent component analysis (ICA) has proven useful for modeling brain and electroencephalographic (EEG) data. Here, we present a new, generalized method to better capture the dynamics of brain signals than previous ICA algorithms. We regard EEG sources as eliciting spatio-temporal activity patterns, corresponding to, e.g., trajectories of activation propagating across cortex. This leads to a model of convolutive signal superposition, in contrast with the commonly used instantaneous mixing model. In the frequency-domain, convolutive mixing is equivalent to multiplicative mixing of complex signal sources within distinct spectral bands. We decompose the recorded spectral-domain signals into independent components by a complex infomax ICA algorithm. First results from a visual attention EEG experiment exhibit (1) sources of spatio-temporal dynamics in the data, (2) links to subject behavior, (3) sources with a limited spectral extent, and (4) a higher degree of independence compared to sources derived by standard ICA.Comment: 21 pages, 11 figures. Added final journal reference, fixed minor typo

arXiv.org e-Print Archive

CiteSeerX

Crossref

Evaluation and comparison of different machine learning approaches to auditory spectro-temporal receptive field estimation

Author: AF Meyer
Arne F Meyer
FE Theunissen
Frank W Ohl
Jan P Diepenbrock
Jörn Anemüller
MFK Happel
O Schwartz
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Dynamic behaviour of brain and surrogate materials under ballistic impact

Author: Anemüller Jörn
Bach Jörg-Hendrik
Caputo Barbara
Gool Luc Van
Havlena Michal
Hermansky Hynek
Jie Luo
Kayser Hendrik
Leibe Bastian
Motlicek Petr
Pajdla Tomas
Pavel Misha
Torii Akihiko
Zweig Alon
Publication venue: University of Canterbury. Mechanical Engineering
Publication date: 01/01/2008
Field of study

In the last several decades the number of the fatalities related to criminally inflicted cranial gunshot wounds has increased (Aarabi et al.; Jena et al., 2014; Mota et al., 2003). Back-spattered bloodstain patterns are often important in investigations of cranial gunshot fatalities, particularly when there is a doubt whether the death is suicide or homicide. Back-spatter is the projection of blood and tissue back toward the firearm. However, the mechanism of creation of the backspatter is not understood well. There are several hypotheses, which describe the formation of the backspatter. However, as it is difficult to study the internal mechanics of formation of the backspatter in animal experiments as the head is opaque and sample properties vary from animal to animal. Performing ballistic experiments on human cadavers is rarely not possible for ethical reasons. An alternative is to build a realistic physical 3D model of the human head, which can be used for reconstruction of crime scenes and BPA training purposes. This requires a simulant material for each layer of the human head. In order to build a realistic model of human head, it is necessary to understand the effect of the each layer of the human head to the generation of the back-spatter. Simulant materials offer the possibility of safe, well‐controlled experiments. Suitable simulants must be biologically inert, be stable over some reasonable shelf‐life, and respond to ballistic penetration in the same way as the responding human tissues. Traditionally 10-20% (w/w) gelatine have been used as a simulant for human soft tissues in ballistic experiments. However, 10-20% of gelatine has never been validated as a brain simulant. Moreover, due to the viscoelastic nature of the brain it is not possible to find the exact mechanical properties of the brain at ballistic strain rates. Therefore, in this study several experiments were designed to obtain qualitative and quantitative data using high speed cameras to compare different concentrations of gelatine and new composite material with the bovine and ovine brains. Factors such as the form of the fragmentation, velocity of the ejected material, expansion rate, stopping distance, absorption of kinetic energy and effect of the suction as well as ejection of the air from the wound cavity and its involvement in the generation of the backspatter have been investigated. Furthermore, in this study a new composite material has been developed, which is able to create more realistic form of the fragmentation and expansion rate compared to the all different percentage of the gelatine. The results of this study suggested that none of the concentrations the gelatine used in this study were capable of recreating the form of the damage to the one observed from bovine and ovine brain. The elastic response of the brain tissue is much lower that observed in gelatine samples. None of the simulants reproduced the stopping distance or form of the damage seen in bovine brain. Suction and ejection of the air as a result of creation of the temporary cavity has a direct relation to the elasticity of the material. For example, by reducing the percentage of the gelatine the velocity of the air drawn into the cavity increases however, the reverse scenario can be seen for the ejection of the air. This study showed that elastic response of the brain tissue was not enough to eject the brain and biological materials out of the cranium. However, the intracranial pressure raises as the projectile passes through the head. This pressure has the potential of ejecting the brain and biological material backward and create back-spatter. Finally, the results of this study suggested that for each specific type of experiment, a unique simulant must be designed to meet the requirements for that particular experiment

UC Research Repository

Publikationsserver der RWTH Aachen University

Archivio della ricerca- Università di Roma La Sapienza

Correlated modulation: a criterion for blind source separation,” Joint meeting of Acoustical Society of America and European Acoustics Association

Author: Jörn Anemüller
Publication venue
Publication date
Field of study

Summary: The problem of blindly separating a convolutive mixture of modulated signals is considered. Spectrograms of the signals are computed and separation is performed in the frequency domain. A new algorithm for blind source separation is proposed, which is based on correlated modulation in the sources ’ different frequency channels. For example, speech contains correlated modulation in different frequency regions. The algorithm successfully separates mixtures of modulated artificial signals and of speech

CiteSeerX

Deep network source localization and the influence of sensor geometry

Author: Anemüller Jörn
Schoof Hendrik
Publication venue: Deutsche Gesellschaft für Akustik
Publication date: 01/01/2019
Field of study

Publikationsserver der RWTH Aachen University